(1) GENERAL INFORMATION: 


(i) APPLICANT: Hewick, Rodney M. 

Wang, Jack H. 
Wozney, John M. 
Celeste, Anthony J. 

(ii) TITLE OF INVENTION: Bone and Cartilage Induct ve Proteins 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Legal Affairs, Genetic^ Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge / 

(D) STATE: MA /s > 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disky 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DO'S /MS -D£ 

(D) SOFTWARE: Patentln Re^ase^fl. 0, Version #1.25 

(vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER/ US 07/800,364 

(B) FILING DATE: 26-N0V-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kapinos /Ellen J. 

(B) REGISTRATION ^NUMBER: 32,245 

(C) REFERENCE/DOCKET NUMBER: GI 5182A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE/ 617-876-1170 

(B) TELEFAX: /617-876-5851 

(2) INFORMATION FOR /SEQ ID NOll: 

(i) SEQUENCE ^CHARACTERISTICS : 

(A) LENGTH : 23 amino acids 

(B) TYPE: amino acid 

(C) STOANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) /ORIGINAL SOURCE: 

(F) TISSUE TYPE: Bone 


42 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu As£ Trp 
15 10 X5 

Val He Ala Pro Gin Gly Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single / ^ 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 
(F) TISSUE TYPE: Bone 

(xi) SEQUENCE DESCRIPTION: SfeQ ID NO: 2: 

Leu Ser Ala Thr Ser Val lieu Tyr Tyr Asp Ser Ser Asn Asn Val He 
1 5 / 10 15 

Leu Arg 

(2) INFORMATION FOR SEQAd NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH y 7 amino acids 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) M0LECUIJ5 TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTlASENSE: NO 

(vi) OMfGINAL SOURCE: 

(k) ORGANISM: Bos taurus 
/(F) TISSUE TYPE: Bone 

(xi/ SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

La Cys Cys Ala Pro Thr Lys 
5 
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(2) INFORMATION FOR SEQ ID NO: 4: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Bos taurus 
(F) TISSUE TYPE: Bone 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Thr Asn Glu Leu Pro Pro Pro Asn 
1 5 

Val His Gly Ser His Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 5:, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base^pairs 

(B) TYPE: nucleic Acid 

(C) STRANDEDNESS/ double 

(D) TOPOLOGY: lihear 

(ii) MOLECULE TYPE : /DNA (genomic) 
(iii) HYPOTHETICAL/ NO 
(iv) ANTI-SENSe/ NO 

(vi) ORIGINAL/SOURCE: 

(A) ORGANISM: Bos taurus 

(vii) IMMEDIATE SOURCE: 

(B) /CLONE: acc30 

(viii) PORTION IN GENOME: 
:) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 25.. 57 


Leu Pro Gly lie Phe Asp Asp 
10 15 


ci) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 


GGATCCGCGT GCTGTGCTCC GACC AAG 
/ Lys 
/ 1 


CTG AGC GCC ACC TCC GTG CTC TAC 
Leu Ser Ala Thr Ser Val Leu Tyr 

5 


51 


TAC GAC AGCAGCAACA ATGTAATTCT AGA 
Tyr Asp 
10 


44 


i 80 


v 

ft 

V 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (generic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos /Taurus 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Bovine genomic 

(B) CLONE: Lambda 9800-10 

(viii) POSITION IN GENOME: 

(C) UNITS: /bp 

(ix) FEATURE: 

(A) NAME^KEY: exon 

(B) LOgATION: 30.. 199 

(ix) FEAT 

(A) tfAME/KEY: intron 

(B) /LOCATION: 1..29 

(ix) FE^ 

0 NAME/KEY: CDS 
/(B) LOCATION: 30.. 179 


(xi/ SEQUENCE DESCRIPTION: SEQ ID NO: 7: 


TGCCCGCTGC CCCCTCCCGC CCCCGCCAG GTG CAC CTG CTG AAG CCG CAC GCG 
/ Val His Leu Leu Lys Pro His Ala 

/ 15 


53 


45 

GTC CCC AAG GCG TGC TGC GCG CCC ACC AAG CTG AGC GCC ACT TCC GTG 
Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val 
10 15 20 

CTC TAC TAC GAC AGC AGC AAC AAC GTC ATC CTG CGC AAG CAC CGC 
Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie Leu Arg Lys His Arg asn 
25 30 35 / 40 

ATG GTG GTC CGC GCC TGC GGC TGC CAC TGAGGCCCCA ACTCCACCGC 
Met Val Val Arg Ala Cys Gly Cys His 

45 50 

CAG 


(2) INFORMATION FOR SEQ ID NO: 8 


\ 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ 

Val His Leu Leu Lys Pro His Ala Val 
1 5 


Cys Cys Ala Pro 
15 


Thr Lys Leu Ser Ala Thr Ser Val 
20 


Tyr Tyr Asp Ser Ser Asn Asn 
15 30 



101 


149 


196 


199 


Val lie Leu Arg Lys His Arg Asrf Met Val Val Arg Ala Cys Gly Cys 
35 ¥0 45 

His 


(2) INFORMATION FOR SEQ /LD NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:/ 172 base pairs 

(B) TYPE: /ucleic acid 

(C) STRANE&DNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULET TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIg/naL SOURCE: 

(A/ ORGANISM: Bos taurus 

(vii) IM&EDIATE SOURCE: 

(A) LIBRARY: Bovine genomic 

(B) CLONE: Lambda 9800-10 

(viii)/ POSITION IN GENOME: 

(C) UNITS: bp 
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(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 51.. 161 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1..50 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 162.. 172 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 51.. 161 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9j 
GGGGTGGGAG GGCACGTGGA TGGGACTCAC CTTCTCC 


KC TACCCCCCAG GAC TGG 
Asp Trp 
1 


56 


GTC ATC GCC CCC CAA GGC TAC TCA GCC/TAT TAC. 
Val lie Ala Pro Gin Gly Tyr Ser Aly Tyr 
5 10 


GAA GGG GAG TGC 
Cys Glu Gly Glu Cys 
15 


104 


TCC TTC CCG CTG GAC TCC TGC ATG, 
Ser Phe Pro Leu Asp Ser Cys Me J 
20 25 

CAG TCC CTG GTCAGTACCT C 
Gin Ser Leu 

35 


iC GCC ACC AAC CAC GCC ATC CTG 
Ala Thr Asn His Ala lie Leu 
30 


152 


172 


(2) INFORMATION FOR SEgf ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp. Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly 
1/5 10 15 

Glu Cys Se/ Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala 
20 25 30 

lie Leu jfcln Ser Leu 
35 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Bovine genous 

(B) CLONE: Lambda 9800-10 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 20.. 99 

( ix) FEATURE : 

(A) NAME/KEY: intron 

(B) LOCATION: 1..19 

( ix) FEATURE : 

(A) NAME/KEY: intttz 

(B) LOCATION: 100/. 119 

( ix) FEATURE : 

(A) NAME/KEY: /IDS 

(B) LOCATION :/2 2. .99 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCCTTGCGTG TCCCCGQAGA C GAC GTC CAC GGC TCC CAC GGC CGG CAG GTG 

Asp Val His Gly Ser His Gly Arg Gin Val 
15 10 

TGC CGT CGG CAC GAG CTG TAC GTG AGC TTC CAG GAC CTG GGC TGG CTG 
Cys Arg Arg qis Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu 

15 20 25 

gtgagttccg/actctccttt 


51 


99 


119 


(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 
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Asp Val His Gly Ser His Gly Arg Gin Val Cys Arg Arg His Glu Leu 
1 5 10 15 

Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human Heart. 


/ 


y 


I 


(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Human heart 

#936208 

(B) CLONE: hH38 

(viii) POSITION IN GENOME: 

(C) UNITS : bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8./850 


DNA librarvjstfatagene catalog 


(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 


aatjeptide 
127. ,843 


(ix) FEATURE: 

(A) NAME/KfcY: mRNA 

(B) LOCATION: 1..997 


(xi) SEQUENC 

GAATTCC GAG 
Glu 
-139, 

CCG GCT GGC 
Pro Ala G] 
-125 


DESCRIPTION: SEQ ID NO: 13: 


-135 -130 


-120 


-115 


-105 


-100 


GTG 
Val 


-90 


-85 


-80 


CAG 

ATC 

Gin 

He 

AAG 

GTG 

Lys 

Val 


-110 

TTC 

CAG 

Phe 

Gin 

-95 


TTG 

GAT 

Leu 

Asp 


49 


97 


145 


193 


49 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTS 
Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Ve 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CZ*G GGA 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp JLen Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG (SAT CCT GGC 
Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val/Asp Pro Gly 
-45 -40 -35 / -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC/&AA CAG CCT TTC 
Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Se/ Gln^Gln Pro Phe 
-25 -20 / / / -15 


241 


289 


337 


385 


GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC 
Val Val Thr Phe Phe Arg Ala Ser Pro Ser V\ 
-10 -5 


(TC CGC ACC CCT CGG 
lie Arg Thr Pro Arg 
1 


433 


GCA GTG AGG CCA CTG AGG AGG AGG CAG CCf? AAG AAA AGC 
Ala Val Arg Pro Leu Arg Arg Arg Gin I>ro Lys Lys Sf 
5 10 


SAG CTG 
Vsn Glu Leu 


481 


CCG CAG GCC AAC CGA CTC CCA GGG ATC^KP^GAT 
Pro Gin Ala Asn Arg Leu Pro Gly Pie Phe Asp 
20 25 / 30 

CAC GGC CGG CAG GTC TGC CGT C<& CAC GAG CTC 
His Gly Arg Gin Val Cys Arg Rrg His Glu Leu 

40 / 45 


GAC GTC CAC GGC TCC 
Asp Val His Gly Ser 

35 

TAC GTC AGC TTC CAG 
Tyr Val Ser Phe Gin 
50 


529 


577 


GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC 

Asp Leu Gly Trp Leu Asp 2?rp Val lie Ala Pro 
55 / 60 

TAT TAC TGT GAG GGG GZ(g TGC TCC TTC CCG CTG 

Tyr Tyr Cys Glu Gly (Slu Cys Ser Phe Pro Leu 
70 / 75 


CAA GGC TAC TCA GCC 
Gin Gly Tyr Ser Ala 
65 

GAC TCC TGC ATG AAC 
Asp Ser Cys Met Asn 
80 


625 


673 


GCC ACC AAC CAC GCfc ATC CTG CAG TCC CTG GTG 
Ala Thr Asn His Kla lie Leu Gin Ser Leu Val 
85 / 90 


CAC CTG ATG AAG CCA 
His Leu Met Lys Pro 
95 


721 


AAC GCA GTC COC AAG GCG TGC TGT GCA CCC ACC 
Asn Ala Val Bfro Lys Ala Cys Cys Ala Pro Thr 
100 / 105 110 


AAG CTG AGC GCC ACC 
Lys Leu Ser Ala Thr 
115 


769 


TCT GTG CTC TAC TAT GAC AGC AGC AAC AAC GTC 
Ser Val Lfu Tyr Tyr Asp Ser Ser Asn Asn Val 
120 125 


ATC CTG CGC AAG CAC 
lie Leu Arg Lys His 
130 


817 


CGC AAC/ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 
Arg Asp Met Val Val Lys Ala Cys Gly Cys His 
135 140 


870 


TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA 
TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC 


GGCAGAAAAC CCTTAAATGC 930 
GGTGCCTACT TCCTGTCAGG 990 


50 


CTTCTGGGAA TTC 


1003 


(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14/ 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp LeuTThr Gin lie Pro Ala 
-139 -135 -13</ ^ J -125 


Gly Glu Ala Val Thr Ala Ala Glu Phe Ai 
-120 -115 

lie His Leu Leu Asn Arg Thr Leu His^ 
-105 -100 

Gin Glu Gin Ser Asn Arg Glu Ser 
-90 -85 


IB Ty/ Lys Val Pro Ser 
-110 

/al Ser IJefPhe Gin Val Val 
-95 

Tu Phe Phe Leu Asp Leu Gin 
-80 


Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala 
-75 -70 / -65 -60 

Ala Ser Asp Cys Trp Leu Le<u Lys Arg His Lys Asp Leu Gly Leu Arg 
-55 / -50 -45 

Leu Tyr Val Glu Thr Gl/ Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 / -35 -30 

Gly Leu Leu Gly Gln/Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 / -20 -15 

Thr Phe Phe Arg /la Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val 
-10 / -5 1 5 

Arg Pro Leu A^g Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 20 

Ala Asn Arc/ Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly 
25 30 35 

Arg Gin /al Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 
40, 45 50 

Gly T-/p Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr 
55 60 65 

Cys/ Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr 
lp 75 80 85 

sn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala 
90 95 100 


/ 



